SVM Approach to Forum and Comment Moderation

نویسنده

  • Adam Maus
چکیده

Social networks, blogs, and forums bring users together to build a community usually based on a system of communication through comments. Based on the degree of complexity, these systems may or may not have a moderator whose primary purpose is to remove spam or abusive comments within these systems. When a moderator is used, it is most often a human, whose time and energy must be exerted to read each and every comment making it a tedious job for a large website. A support vector machine (SVM) approach is proposed for comment moderation. Using a training corpus obtained from the popular website Youtube.com, a support vector machine is used to classify comments as abusive or not. Baseline accuracy is found by performing 10-fold cross validation on unprocessed data. Different experiments are performed on the data by preprocessing it to find if certain variations provide a more accurate estimate.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Moderation of Comments in a Large On-line Journalistic Environment

On-line journalistic sites publish several news and stories every day. Readers of these sites may comment a story, and, as a consequence, a single story might receive thousands of comments. The quality of these comments may vary a lot, from spams and trolls to truly useful information. Separating good from bad comments is an important task, and is the primary goal of comment moderation. Moderat...

متن کامل

KeyGraph for Visualization of Discussions in Comments of a Blog Entry with Comment Scores

This paper discusses a new application of KeyGraph for visualization of discussions in comments of a blog entry in Slashdot. KeyGraph is a visualization tool for discovery of relations among text-based data. A common approach of applying KeyGraph is that of applying it to the whole data at once. In this paper, we propose an approach that applies KeyGraph successively to multiple chunks of comme...

متن کامل

How a moderated online discussion forum facilitates support for young people with eating disorders

INTRODUCTION Young people with eating disorders are at risk of harm to their social, emotional and physical development and life chances. Although they can be reluctant to seek help, they may access social media for information, advice or support. The relationship between social media and youth well-being is an emotive subject, but not clearly understood. This qualitative study aimed to explore...

متن کامل

Extracting Chatbot Knowledge from Online Discussion Forums

This paper presents a novel approach for extracting high-quality pairs as chat knowledge from online discussion forums so as to efficiently support the construction of a chatbot for a certain domain. Given a forum, the high-quality pairs are extracted using a cascaded framework. First, the replies logically relevant to the thread title of the root mes...

متن کامل

Learning to Perform Moderation in Online Forums

Online discussion forums are a valuable resource for people looking to find information, discuss ideas, and get advice on the Internet. Unfortunately, many forums have too much activity and information available, resulting in information overload. Moderation systems are implemented in some forums as a way to handle this problem, but due to sparsity issues, they are often not sufficient. In this...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009